WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PA TENT COOPERATION TREATY (PCT) 

WO 00/39332 

6 July 2000 (06.07.00) 



(51) International Patent Classification 7 
C12Q 1/68 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



(21) International Application Number: PCT/GB99/04380 

(22) International Filing Date: 22 December 1999 (22.12.99) 



(30) Priority Data: 

9828619.8 



23 December 1998 (23.12.98) GB 



(71) Applicant (for all designated States except US): JANSSEN 

PHARMACEUTICA N.V. [BE/BE]; Tumhoutsewcg 30, 
B-2340 Beerse (BE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): PAULUSSEN, Aimee, 
Dymphne, Catherine [NlVBE]; Janssen Pharmaceutica N.V., 
Turnhoutseweg 30, B-2340 Bcerse (BE). ARMSTRONG, 
Martin [GB/GB]; Pippins, Grove Road, Wickambreaux, 
Canterbury, Kent CT3 1SJ (GB). 

(74) Agent: BOULT WADE TENNANT; Verulam Gardens, 70 
Gray's Inn Road, London WC1X 8BT (GB). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA BB BG 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK DM EE 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID IL IN IS JP 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV MA* 
MD, MG, MK, MN, MW, MX, NO, NZ, PL PT RO RU 
SD, SE, SG, SI, SK, SL, TJ, TM, TR T TT, TZ, UA* UG 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH. GM KE* 
LS, MW, SD, SL, SZ, TZ. UG, ZW), Eurasian patent (AM* 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT LU 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, d'cM* 
GA, GN, GW, ML, MR, ME, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of (he time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: GENOTYPING CYTOCHROME EXPRESSION 
(57) Abstract 

There is disclosed a method of identifying subjects having a high or low drug metabolising phenotype associated with cytochrome 
CYP3A5 expression, which method comprises screening genomic DNA from said subject for the presence or absence of one or more 
polymorphic vanants in a transcription regulatory region of the sequence encoding CYP3A5. Oligonucleotide molecules for carrvin^ out 
the screening are also provided. ° 



FOR THE PURPOSES OF INFORMATION ONLY 
Co*. «, t0 identif y Stat, party to the PCTon the front pagcs of pamphlcts P u bIishing intemational applications under ^ pcr 



AL 


Albania 


ES 


AM 


Armenia 


FI 


AT 


Austria 


FR 


AU 


Australia 


GA 


AZ 


Azerbaijan 


GB 


BA 


Bosnia and Herzegovina 


GE 


BB 


Barbados 


GH 


BE 


Belgium 


GN 


BF 


Burkina Faso 


GR 


BG 


Bulgaria 


HL> 


BJ 


Benin 


IE 


BR 


Brazil 


IL 


BY 


Belarus 


IS 


CA 


Canada 


IT 


CF 


Central African Republic 


JP 


CG 


Congo 


KE 


CH 


Switzerland 


KG 


CI 


Cote d'lvoire 


KP 


CM 


Cameroon 




CH 


China 


KR 


cu 


Cuba 


KZ 


cz 


C2cch Republic 


LC 


DE 


Germany 


U 


DK 


Denmark 


LK 


EE 


Estonia 


LR 



Spain 
Finland 
France 
Gabon 

United Kingdom 

Georgia 

Ghana 

Guinea 

Greece 

Hungary 

Ireland 

Israel 

Iceland 

Italy 

Japan 

Kenya 

Kyrgyzstan 

Democratic People's 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Lucia 

Liechtenstein 

Sri Lanka 

Liberia 



IS 
LT 
LU 
LV 
MC 
MD 
MG 
MK 



Lesotho 

Lithuania 

Luxembourg 

Latvia 

Monaco 

Republic of Moldova 
Madagascar 
The former Yugoslav 
Republic of Macedonia 



ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


FT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 



SI 
SK 
SN 

sz 

TD 
TG 
TJ 
TM 
TR 
TT 
UA 
UG 
US 

uz 

VN 
YU 
ZW 



Slovenia 

Slovakia 

Senegal 

Swaziland 

Chad 

Togo 

Tajikistan 

Turkmenistan 

Turkey 

Trinidad and Tobago 

Ukraine 

Uganda 

United States of America 

Uzbekistan 

Viet Nam 

Yugoslavia 

Zimbabwe 



(5/f 



09/ 869169 



WO 00/39332 



PCT/GB99/O4380 



- 1 - 



iC03 Rsc'd PCT 



TC 2 2 JUN 200? 



GENOTYPING CYTOCHROME EXPRESSION 



The present invention is concerned with an assay and, 
in particular, with an assay for genotyping a 
polymorphism predictive of a phenotype associated with 
cytochrome expression, in this case CYP3A5. 

The cytochrome P450 subfamily CYP3A represents one of 
the most important families of the P4 50 super family 
and plays a major role in the metabolism of an ever 
expanding list of therapeutic compounds (23, 24). This 
family comprises the most abundantly expressed P450s 
in human livers, and is responsible for the metabolism 
of over 50% of all clinically used drugs, incXuding 
the dihydropyridines , cyclosporin, erythromycin and 
barbiturates (1) . Wide inter-individual variation in 
the metabolism of CYP3A substrates has been noted and 
is a factor in determining individual drug efficacy. 
Evidence also exists for the metabolism of an array of 
lipophilic environmental pollutants, including the 
activation of pro-carcinogens such as aflatoxin Bl by 
members of this subfamily (2) . 

Presently, four CYP3A cDNAs have been identified in 
humans, CYP3A3, CYP3A4 , CYP3A5 and CYP3A7. it is 
believed that CYP3A3 represents an allelic variant of 
CYP3A4, whilst CYP3A4 and CYP3A7 are found only in 
human adult and fetal livers respectively (3) - Initial 
experiments suggested that a polymorphism existed in 
CYP3A4 (4) . However other studies, whilst confirming a 
wide range of inter-individual variation in C^tfP3A4 
expression have failed to confirm the original! 
bimodality (5, 6) . Overlapping substrate specificities 
between CYP3A5 and CYP3A5 have previously mads it 
difficult to separate metabolism by these isorforms; 
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consequently little phenotyping da~a have been 
produced to study variation in CYP3A5 activity in 
humans. However, there is evidence for the polymorphic 
express ion of CYP3A5. Use of both immunoblot ting and 
Northerrn analysis have detected CYP3A5 expression in 
only 10-30% of human livers (7, 8, 9). More recently, 
analysis £ of 30 human liver samples using 
immunob Jotting found that only 3% showed no detectable 
CYP3A5 , whilst a large number had trace amounts, 
suggesting that a polymorphism in this enzyme may be 
regulatory as opposed to structural (10). 
Comparisons of the 5' flanking regions from the 
CYP3A4, 3A5 and 3A7 genes have identified putative 
binding sites for several transcriptional regulatory 
factors common to all isoforms (11, 12, 13) . However, 
the molecular basis, if any, for this inter-individual 
variation in expression of the CYP3A sub-family 
members has so far remained unclear. Indeed it has 
been suggested that the host cellular environment may 
be a greater determinant of inducibility than gene 
structure (14) . However, the determination of a major 
genetic component to variant expression and activity, 
linked to an easy screening method, would be extremely 
beneficial, not only in providing a predictor of 
individual response to drugs which are metabolised by 
these isoforms, but also in facilitating association 
studies between CYP3A and disease processes. 

The delineation of CYP3A4 and CYP3A5 metabolism has 
been shown to be possible using the sedative midazolam 
as a probe drug (15). In this case two metabolites are 
formed, 1-hydroxy midazolam (1-OHM) and 4-hydroxy 
midazolam (4-OHM) . Those samples containing a higher 
proportion of CYP3A5 compared to CYP3A4 have their 
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metabolism driven towards the 1-OHM route and 
therefore show a higher ratio of 1-OHM/4-OHM than 
those containing only CYP3A4 . The present inventors 
have now established that two polymorphisms, located 
in putative transcriptional regulatory regions, which 
caused increased CYP3A5 gene expression and metabolic 
activity are linked and have developed assays for 
their detection. These assays will allow prediction 
of intexr-individual variability in response to drugs 
metabolised by this isoform, as well as facilitating 
disease association studies. 

Therefore, according to a first aspect of the present 
invention there is provided a method of identifying 
subjects having a high or low drug metabolising 
phenotype associated with cytochrome CYP3A5 
expression, which method comprises screening for the 
presence or absence in the genome of a subject a 
polymorphic variant in a transcription regulatory 
region , such as, a promoter or enhancer adjacent the 
region encoding CYP3A5. Preferably, the method 
involves screening for a variant in a recognition site 
for a transcription factor of said regulatory region, 
and even more preferably in an activator protein-3 
motif or a basic transcription element. Even more 
preferably, the method involves screening for a 
variant at any one of positions -475 or -147 of the 
DNA of the 5' flanking region adjacent to the region 
encoding CYP3A5 the sequence of which flanking region 
is illustrated in Figure, 7 and preferably, for both 
the variants at positions -475 and -147. 

In one embodiment of the method of the invention 
genomic DNA is amplified, preferably by the polymerase 
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chain zreaction using oligonucleotide molecules capable 
of hybrr idising selectively to the wild type sequence 
or the variant sequences, such that generation of 
amplifi_ ed DNA from said molecules will indicate 
whether said wild type or mutation is present. in 
this method PCR primers hybridise either to the 
mutated or wild type sequence, but not both. 
Amplification of the DNA of the respective mutation or 
wild type genotype using the respective primers will 
provide an indication of the presence of the wild type 
or mutated nucleotide mutations. 



A further method of the invention advantageously 
utilises oligonucleotide molecules as primers which, 
15 in addition to hybridising to the site of interest, 

are capable of introducing a restriction site which is 
absent in either the wild type sequence or polymorphic 
variants. Therefore, according to a further aspect of 
the invention, there is provided a method of 
20 identifying subjects having a high or low drug 
metabolising phenotype associated with CYP 3A5 
expression, which method comprises 1) amplifying 
genomic DNA from a subject using oligonucleotide 
molecules capable of hybridising to the wild type 
25 sequence and/or to the polymorphic variant sequence at 
a location being analysed, which molecules are such 
that they can introduce a restriction site at said 
location which is not present in the wild type or 
variant sequences, and 2) subjecting amplified DNA 
30 from step 1 to a restriction enzyme which cleaves the 
DNA at said restriction site to provide a restriction 
digest indicative of the presence or absence of said 
variant. 



35 



The method preferably comprises amplifying DNA 
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recognition site for a transcription factor of said 
regulatory region and preferably in an activator 
protein- 3 motif (AP-3) and/or basic transcription 
element ( BTE) . Preferably, the method comprises 
5 amplifying DNA spanning any of position -475 or -147, 
of the regulatory region of CYP 3A5, the sequence of 
which is illustrated in Figure 7. 

The polymorphisms at the positions identified in each 
10 of the methods according to the invention comprise 
T- 4 75 - G and A_, 47 - G. As presented in the Examples 
below, the molecule used to detect the variation at 
A-147 "* G is capable of introducing a restriction site 
for the enzyme Tai I only when the wild type A 
15 nucleotide is present at position -147. 

Alternatively, the molecule used to detect the 
T. 475 - G nucleotide variation is capable of 
introducing a restriction site for the enzyme Alu I 
only when the wild type T nucleotide is present at 
20 position -475. 

In this embodiment an example of suitable primers is 
any of 3A5F1 GGGTCTGTCTGGCTGCGC 
and 3A5F2 (GGGGTCTGTCTGGCTGAGC) 
25 and 3A5R1 (TTTATGTGCTGGAGAAGGACG) . 

Using oligonucleotide mismatch primer 3A5R1 creates a 
Ta.i I recognition site only when the wild type A 
nucleotide is present at position -147. Digestion of 

30 the 369bp product with Tai I yields fragments of 34 9 
and 20bp for the wild type sequence, whilst tine 
product remains undigested if a mutant, such as the G 
nucleotide, is present (Figure 2). Similarly, for the 
detection of the T..- 5 G mutation a second 

35 oligonucleotide mismatch primer 3AF2 may be used. 
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This pr imer introduces a recognition site for the 
restriction enzyme Alu I when the wild type T is 
present at position -475, digesting the product to 
yield fragments of 318, 33 and 18bp. This site is 
lost when the mutant G nucleotide is present, yielding 
digestion products of 336 and 33bp (Figure 3) . 

Known techniques for the scoring of single nucleotide 
polymorphisms (see review by Schafer, A. J. and 
Hawkins, J. R. in Nature Biotechnology, Vol 16, pp33- 
39 (1998) include mass spectrometry, particularly 
matrix-assisted laser desorption/ionization time-of- 
f light mass spectrometry (MALDI-TOF-MS, se Roskey, M. 
T. et.al-, 1996, PNAS USA, 93: 4724-4729), single 
nucleotide primer extension (Shumaker, J. m. et.al., 

1996, Hum. Mutat., 7: 346-354; Pastinen, T. et.al., 

1997, Genome Res., 7: 606-614) and DNA chips or 
microarrays (Underhill, P. A . et.al., 1996, PNAS USA, 
93: 196-200; Gilles, P. N. et.al. Nat. Biotech., 1999, 
17: 365-370). The use of DNA chips or microarrays 
could enable simultaneous genotyping at many different 
polymorphic loci in a single individual or the 
simultaneous genotyping of a single polymorphic locus 
in multiple individuals. 

In addition to the above, SNPs are commonly 
scored using PCR-SSCP based techniques, such as PCR- 
SSP using allele-specific primers (described by Bunce, 
1995) . If the SNP results in the abolition or 
creation of a restriction site then genotyping can be 
carried out by performing PCR using non-allele 
specific primers spanning the polymorphic site and 
digesting the resultant PCR product using the 
appropriate restriction enzyme. The known techniques 
for scoring polymorphisms are of general applicability 
and it would therefore be readily apparent to persons 
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skilled in the art that the known techniques could be 
adapted for the scoring of single nucleotide 
polymorphisms in the the regulatory region of CYP 3A5. 

As woulci be readily apparent to those skilled in the 
art, genotyping is generally carried out on genomic 
DNA prepared from a suitable tissue sample obtained 
from the subject under test. Most preferably, genomic 
DNA is prepared from a blood sample, according to 
standard procedures which are well known in the art 



Also provided by the present invention is an 
oligonucleotide of at least 10 contiguous nucleotides 
to detect polymorphic variants in a 5' regulatory 
15 region adjacent the sequence encoding cytochrome 

CYP3A5 associated with a high or low drug metabolising 
phenotype. The oligonucleotide is capable of 
hybridising to a region incorporating either a mutated 
or wild type nucleotide at position -475 or -147 of 
said flanking region, such that amplification of said 
positions will or will not proceed from said rrimer 
according to whether or not a polymorphic variant 
occurs at any of said positions. 

25 The oligonucleotide molecules of the invention 

^ are preferably from 10 to 50 nucleotides in length, 

even more preferably from 20-30 nucleotides ir. length, 
and may be DNA, RNA or a synthetic nucleic acid, and 
may be chemically or biochemically modified or may 

30 contain non-natural or derivatized nucleotide bases, 

as will be readily appreciated by those skilled in the 
art. Possible modifications include, for exarr.ple, the 
addition of isotopic or non-isotopic labels, 
substitution of one or more of the naturally occurring 

35 nucleotide bases with an analog, internucleoti.de 
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modifications such as uncharged linkages (e.g. methyl 
phosphonates, phosphoamidates , carbamates, etc.) or 
charged linkages (e.g. phosphoro thioates , 
phospho zrodithioates, etc.). Also included are 
synthetic molecules that mimic polynucleotides in 
their ability to bind to a designated sequence to form 
a stable hybrid. Such molecules are known in the art 
and include, for example, so-called peptide nucleic 
acids (PNAs) in which peptide linkages substitute for 
phosphate linkages in the backbone of the molecule. 
An oligonucleotide molecule according to the invention 
may be produced according to techniques well known in 
the art, such as by chemical synthesis or recombinant 
means . 

The oligonucleotide molecules of the invention 
may be double stranded or single stranded but are 
preferably single stranded, in which case they may 
correspond to the sense strand or the antisense strand 
of the 5 r regulatory region of CYP3A5 . The 
oligonucleotides may advantageously be used as probes 
or as prirr.ers to initiate DNA synthes is/DNA 
amplification. They may also be used in diagnostic 
kits or the like for detecting the presence of one or 
more variants alleles of the regulatory region of 
CYP3A5. These tests generally comprise contacting the 
probe with a sample of test nucleic acid (usually 
genomic DNA) under hybridising conditions and 
detecting for the presence of any duplex or triplex 
formation between the probe and complementary nucleic 
acid in the sample. The^ probes may be anchored to a 
solid support to facilitate their use in the detection 
of these variants. Preferably, they are present on an 
array so that multiple probes can simultaneously 
hybridize :o a single sample of target nucleic acid. 
The probes can be spotted onto the array or 
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synthesi-sed in situ on the array. (See Lockhart et 
al. f Nature Biotechnology, vol. 14, December 1996 
"Expression monitoring by hybridisation to high 
density oligonucleotide arrays". A single array can 
5 contain more than 100, 500 or even 1,000 different 
probes in discrete locations. Preferably, the 
oligonucleotides comprise any of the primers 3A5F1, 
3A5F2 and 3A5R1 as defined herein. 



10 Also provided is a kit to perform the method according 
to the invention. Preferably, the kit will comprise 
an oligonucleotide as described herein and even more 
preferably the kit will further comprise one or more 
restriction enzymes capable of distinguishing between 

15 wild-type or polymorphic variants as defined bierein. 

Preferably, the restriction enzyme comprises Tai I or 
A2u I. 



According to a further aspect of the invention there 

20 is also provided a method of identifying toxic or 

mutagenic effects of a test compound, such as, a drug, 
toxin or procarcinogen metabolised by CYP3A5 the 
method comprising contacting each of a cell having a 
high drug metabolising phenotype and a cell having a 

25 low metabolising phenotype associated with cytochrome 
CYP3A5 expression, with said test compound and 
identifying the effects of said compound on each of 
said high or low drug metabolising phenotype cells or 
other cells sensitive to said compound. An even 

30 further aspect comprises »a method of diagnosing 
susceptibility of an individual to a disease 
associated with environmental toxins or procar cinogens 
metabolised by CYP3A5, the method comprising the steps 
of 1) providing a sample containing DNA, and 2 ) 

35 identify! g the presence or absence of a mutation in a 
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transcription regulatory region adjacent to the DNA 
sequence encoding CYP3A5 using a reagent capable of 
distinguishing the presence or absence of a nucleotide 
in said regulatory site. According to this aspect of 
5 the invention./ the mutation occurs in a recognition 
site for" a transcription factor of said regulatory 
region and preferably in an activator protein-3 motif 
(AP-3) and/or a basic transcription element (BTE) , 
Preferably, the mutation occurs at any of positions - 
10 475 and "147 of the regulatory region and even more 

preferably at both positions where the mutation may be 
T_ 475 G or A. :4 -G. 

Advantageously, it is also envisaged that the 
15 regulatory region of the 5* flanking region can be 

used to identify or purify transcription factors which 
bind to the 5' region including the respective 
polymorphic variants. Thus, according to a further 
aspect of the invention, there is provided a method of 
20 identifying transcription factors capable of binding 
to a DNA fragment from a transcription regulatory 
region adjacent DNA encoding cytochrome CYP3A5, said 
method comprising contacting said DNA fragment 
including said transcription regulatory region with 
25 potential transcription factors and identifying any 

transcription factor complexed to said DNA fragments. 

Using the transcription regulatory fragment it is 
possible tc identify compounds or agents which exhibit 

30 or exert their effect on the transcription regulatory 
region of CYP3A5. Thus, \ there is provided according 
to this aspect of the invention a method of 
identifying compounds acting on a transcription 
regulatory region adjacent to a DNA sequence encoding 

35 CYP3A5, the rtethod comprising transforming a cell with 
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a DNA construct comprising the sequence of said 
regulatory region, and which regulatory region is 
operably linked to a sequence encoding a reporter 
molecule, contacting said cell with a test compound 
5 and identifying any expression of said reporter 

molecule. Preferably, said cell is expressing CYP3A5 
or is showing CYP3A5 activity. 



Also provided by the invention is a method of 
10 purification of transcription factors from a sample 
which aire capable of binding to DNA from a 
transcription regulatory region adjacent a DNA 
sequence encoding cytochrome CYP3A5, the method 
V, ] comprising contacting a DNA fragment including said 

15 transcriptional regulatory region with a mixture of 

transcription factors and identifying any complexes of 
said transcription factors and said fragment . 



An even further aspect of the invention comprises a 

20 method of providing a measure of activity of a 
transcription regulatory region adjacent to DNA 
encoding cytochrome CYP3A5 or alternatively a method 
of identifying a mutation which alters the activity of 
the transcription regulatory region the method 

25 comprising providing a DNA construct having a sequence 
encoding a reporter molecule operably linked to a DNA 
fragment comprising said regulatory region, and 
introducing said construct into a cell and monitoring 
for the level of expression of said reporter molecule. 

30 When the method is used* to identify a variant which 
alters the activity of the transcription regulatory 
control region, the method may include the further 
step of comparing the levels of expression of a wild 
type and a polymorphic regulatory region as described 

35 herein. 
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According to each of the aspects of the invention, the 
reguianory region includes a polymorphic variation, 
preferably in a recognition site for a transcription 
factor of said regulatory region, and preferably in an 
activator protein-3 motif (AP-3) and/or a basic 
transcription element (BTE) . In a preferred 
embodiment the variant occurs at position -475 or -147 
of the rregion flanking the sequence encoding CYP3A5, 
and which region is illustrated in Figure 7. 
Preferably, both the variants are present. 

The methods of the present invention will be 
particularly valuable to establish, prior to treatment 
with a drug, whether the drug will be effectively 
metabolised by the patient. 

The invention may be more clearly understood by the 
following example with reference to the accompanying 
drawings wherein 



Fig. la: is an illustration of the relationship 
between midazolam metabolic ratio and 
genotype for the linked A_ : , 7 G and T_ 47S G 
mutations in the 5' flanking region of the 
CYP3A5 gene. Midazolam metabolic ratio l- 
OKM/4-OHM, wt - samples with the wild type 
sequence in the 5' flanking region as 
previously published (11), Het = samples 
heterozygous for the linked polymorphisms, 
A. : . 7 G and T„ 475 G ; 

Fig. lb: is an illustration of the relationship 
between CYP3A5 mRNA expression and the 
linked A„ U7 G and T. 475 G mutations in the 5' 
flanking region of CYP3A5. Relative Ct 
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difference = difference in threshold cycle 
between samples, as described in the methods 
section wt = samples with the wild type 
sequence in the 5' flanking region as 
previously published (11) Het = samples 
heterozygous for the linked polymorphisms, 
A_ 147 G and T. 475 G . 

Fig. 2a: is a diagram of relative position of 

primers, and of the recognition site for the 
restriction enzyme Tai I, which is 
introduced into the PCR product utilising 
mismatched primer 3A5R1 when the wild-type 
"A" nucleotide is present at position -147, 
and is lost when the mutant "G" nucleotide 
is present. 



Fig. 2b: is a diagrammatical representation of 

expected restriction fragments for each 
possible genotype for the A„ : , ? G variant, 
i.e. homozygous wild-type, heterozygous and 
homozygous mutant . 



Fig. 2c: is an illustration of a 1.5% agarose gel of 
Tai I restriction digest of 3A5F2/3A5R1 PCR 
product for detection of the A. 147 G variant. 
Lane 1. 100 bp ladder. Lanes 2 & 7. 
Reference undigested PCR products. Lane 3. 
Sample homozygous for the wild-type "A" 
nucleotide at position -147. Lanes 10, 11, 
16. Samples heterozygous for the A. 147 G 
variant . 



Fig. 3a: is a diagram of relative position of 
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primers, and of the recognition sices for 
the restriction enzyme Alu I. The forward 
recognition site is introduced into the PCR 
product utilising mismatched primer 3A5F2 
5 when the wild-type M T" nucleotide is present 

at position -475, and is lost when the 
variant "G" nucleotide is present. 

Fig. 3b: is a diagrammatical representation of 
10 expected restriction fragments for each 

possible genotype for the T„ 47 = G variant, 
i.e. homozygous wild-type, heterozygous and 
homozygous mutant . 

15 Fig. 3c: is an illustration of a 12.5% 

polyacrylamide ExcelGel of Alu I restriction 
digest of the 3A5F2/3A5R1 PCR product for 
detection of the T.< 75 G mutation. Lane 1.100 
bp ladder. Lanes 2, 5, 6, 7 & 8. Samples 

20 homozygous for the wild-type "T" nucleotide 

at position -147. Lanes 3, 4, 9. Samples 
heterozygous for the T_< 75 G mutation. 
Fragment X - additional digestion product 
resulting from re-amplification of original 

25 template by primers 3A51/3A52. 

Fig. 4a: is a comparison of 1-0HM/4-0HM metabolic 
ratios between samples with the linked 
mutations (HET group) and those wild-type 
30 for the mutations at positions -147 and -475 

(WT group) . Mean and quartiles are shown for 
each group, as is overall mean for the 
combined groups (central line). 



35 Fig. 4b: 



is a comparison of CYP3A5 expression (In 
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transformed) between samples with the linked 
mutations (HET group) and those wild-type 
for the mutations at positions -147 and -475 
(WT group) . Mean and quartiles are shown for 
5 each group, as is overall mean for the 

combined groups (central line) . 

Fig. 5: is a Western blot analysis of CYP3A5 protein 
expression in liver samples. A Western blot 

10 of microsomes prepared from liver samples 

and probed with a CYP3A5 specific antibody. 
Liver samples containing the linked 
polymorphisms at -147 and -475 (wt group) 
are marked * (sizes indicated in kDa from 

15 Wide Range Colour Marker (signs)). 

Fig. 6: is a list of oligonucleotide mismatch 
primers used in accordance with the 
invention, where the underlined nucleotide 
20 indicates the sequence mismatch. 

Fig. 7: is an illustration cf the nucleotide 

sequence of the 5 r flanking region relative 
to the DNA sequence encoding CYP3A5 . 

25 

Fig. 8: is an illustration of the results obtained 

from an Electrophorer ic mobility shift assay 
(EMSA) of A_n ? G oligonucleotide. EMS A was 
carried out as described in materials and 

30 methods. Lane 1, : A-147G oligonucleotide 

without HeLa nuclear extract; lanes 2-8 : in 
the presence of HeLa nuclear extract/ lanes 
3 and 4 : in the presence of 50 - 10O fold 
molar excess of unlabeled A-147G 

35 oligonucleotide; lanes 5 and 6; in trne 
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presence of 50 - 100 fold molar excess of 
unlabeled wild type oligonucleotide; lanes 7 
and 8 : in the presence of 1 and 2 
microlitres of anti-Spl antibody. 

5 

Fig. 9-9d : are illustrations of the results 

obtained from the x find patterns' 
program of the GCG sequence analysis 
package . 

10 

Experimental Procedures 
liver ml a xr o some preparation 

15 Human liver samples were obtained from kidney 

transplant donors, and flash-frozen immediately on 
removal. Human liver microsomes were prepared 
according to previously described protocols (21) , and 
protein content was determined by the method of Lowry 

20 as modified by Miller (22). 

Midazolam hydroxylase assay 

The rates of midazolam overall metabolism and of the 
25 formation of 1- and 4 -OH-midazolam were determined as 
follows. Each incubation vessel contained an aliquot 
of the microsomal suspension (containing 1 mg of 
microsomal protein) in 1*15 % KC1 - 0.01 M phosphate 
buffer pH 7.4; 10 jul of a stock solution of 6 mM 
30 midazolam dissolved in DMSO to reach a final midazolam 
concentration of 60 ,uM; 50,0 jul of a co-factor mixture 
containing 0.5 mg of glucose-6-phosphate , 0.5 mg of 
MgCl 2 .6H 2 0, 0.5 units of glucose-6-phosphat e 
dehydrogenase dissolved in 0.5 M Na-K-phosphate 
35 buffer, pH 7 . 4 and a 1.15 % KC1 - 0.01 M phosphate 
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buffer pH 7.4 to bring the incubation volume to 0.9 
ml. After a pre-incuba t ion for 5 min at 37°C, the 
incubations were started by adding 100 pel of a 
solution of 1.25 mg/ml NADP to reach a final 
5 concentration of 0.125 mg/ml. Tubes were continuously 
shaken at 100 oscillations/min in an Heto shaking 
waterbath. Blank incubates with boiled microsomes 
were incubated under identical conditions as the 
control incubates. The incubations were stopped after 
10 30 min by immersing the tubes in dry ice. Samples 

were stored at < -18°C until analysis. The incubation 
samples were analysed for unchanged midazolam and for 
its metabolites 1*- and 4 -hydroxymidazolam by HPLC 
with UV— detect ion. 

15 

HPLC determination of midazolam metabolites 

The 1-ml samples of midazolam were thawed and diluted 
with 1 ml DMSO. Samples were sonicated for 10 min, 

20 centrifuged and an aliquot of the supernatant: was 
injected directly onto the HPLC-column. The HPLC 
apparatus consisted of a Waters 600 MS pump. The 
samples were injected automatically, using a WISP 717 
plus automatic injector. Stainless steel columns (30 

25 cm x 4 . 6 mm i.d.) Were packed with Kromasill 18 (5 /zm) 
bound phase by a balanced density slurry proa edure 
(Haskel DSTV 122-C pump, 10 7 Pa) . UV-detecti on at 
230 nm was performed using a Waters 996 Diode Array 
Detector. Elution at 1-ml /min started with a short 

30 gradient from 100% 0.1 M ammonium acetate, pH 7.0 

(solvent system A) to 50% of solvent system A. and 50% 
of solvent system B containing 1M ammonium ac etate pH 
7.0, methanol and acetonitrile (10/45/45), over a i- 
min period, followed by a second gradient to 100% 

35 solvent system B in 15 min. This solvent com. position 
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was held for 2 min before equilibration with the 
starting conditions. The identity of the metabolites 
of midazolam was confirmed using mass spectroscopy. 
The conversion of UV-peak areas into ng was performed 
5 by a MiXlennium 2020 CDS system on a calibration curve 
of midazolam. This calibration curve was made up after 
injection of known amounts of the drug (0, 1059, 2117, 
3176 and 5028 ng) and linear (weighted by 1/x) 
regression analysis of the corresponding UV-peak 

10 areas. The equation of the calibration curve was ng = 
0.000333 x area (r 2 - 0.9997 , n = 5). The metabolic 
activity was expressed as pmol metabolite formed/min 
mg protein, and a metabolic ratio was determined for 
each sample according to the ratio of 10HM/4OHM in 

15 each sample. 

Genomic VNA preparation 

DNA was isolated from frozen liver samples using a 
20 QIAmp Tissue Kit (QIAGEN) in accordance with the 
manufacturer' s instructions. 

RNA preparation 

25 RNA was isolated from the liver samples using a QIAGEN 
RNAeasy Midi Kit (QIAGEN), according to manufacturers 
instructions. Twenty ixq of RNA was treated with RNAse- 
free DNAse I (Boehringer Mannheim), for 30 min at 37°C 
in 20 mM Tris-HCl, pH 8.0, 100 mM MgCl 2 . Samples were 

30 phenol/chloroform extracted, precipitated and 

resuspended in 30 fil of ; TE buffer. Two and a half 
of the treated sample was reverse transcribed for 50 
minutes at 42°C in 1 x first strand buffer, 0.01M DTT 
and 0.5M dNTPs using 0.5 ^g of oligo(dt) random 

35 primers and 200 units Superscript II Reverse 
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Transcriptase (GibcoBRL) for use on the ABI Prism 7700 
Sequence Detection System (SDS). 

Sequencing of the CYP3A5 5' flanking xoglon 

5 

A 1343 bp 5* flanking region of CYP3A5 was PCR 
amplified from genomic DNA isolated from liver 
samples, using primers 3A51 (5*- 
GGAAGCAACCTACATGTCCATC) and 3A52 (5'- 

10 ATCGCCACTTGCCTTCTTC) based on the published sequence 
of Jounaidi et ai. (11). PCR conditions were 1 cycle 
of 95°C for 1 min, 30 cycles of 95°C for 1 min, 57°C 
for 30 sec, 72°C for 2.5 min, and 1 cycle of ~72°C for 
10 min. PCR products were purified using a QZAquick 

15 PCR Purification Kit (QIAGEN) , sequencing primers were 
designed (Table 1), and used to directly sequence the 
PCR product on both sense and antisense strands by 
cycle sequencing using the ABI BigDye Terminator cycle 
sequencing kit (Perkin Elmer). Sequencing reactions 

20 were analysed on an ABI 377 automated sequencer. 

Contig sequences were aligned and compared using the 
Sequence Editor version 1.0.3 software packages 
(Perkin Elmer) and manually edited for identification 
of heterozygote positions. 

25 

PCR detection assays fox the A_ 147 G and. T„ 475 G arntations 

All PCR assays were performed utilising a 1 in 100 
dilution of the original 3A51/3A52 PCR product as 

30 template, under the following conditions: 1 c^cle of 
95°C for 1 min, 30 cycles' of 95°C for 1 min, 55°C for 
30 sec, 72°C for 1 min, and 1 final cycle of ~72°C for 
10 min. All products were sequenced to confi J:m the 
identity of the product as CYP3A5. Oligonucl -eotide 

35 mismatched primers utilised in the assays wer-e: 3A5F1 
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(5 ' -GGGTCTGTCTGGCTGCGC) , 3A5F2 (5 ' - 
GGGGTCTGTCTGGCTGAGC) , and 3A5R1 (5'- 

TTATGTGCTGGAGAAGGACG) , where positions of mismatches 
are unde rlined . 

For the A„ U7 G mutation, PCR was performed using primer 
pair 3A5 F2 and 3A5R1. Twenty jul of PCR product was 
digested for a minimum of 3 hours at 65°C using 15 
units of Tax I, and the restriction fragments 
visualised by ethidium bromide staining after 
electrophoresis on a 1,5% agarose gel. 

For the T.,, 5 G mutation, PCR was performed using primer 
pair 3A5F2 and 3A5R1 as described above. Twenty jul of 
PCR product was digested with 15 units of Alu I for a 
minimum of 3 hours, and restriction fragments were 
separated by electrophoresis on a 12.5% ExcelGel on a 
Pharmacia Multiphor Electrophoresis system 
(Pharmacia). Fragments were visualised by silver 
staining in a Hoeffer Automatic Gel Stainder 
(Pharmacia) . 

To detect the presence of mutations on the same 
chromosome, PCR was performed using primers 3A5F1 and 
3A5R1. Twenty ^1 of PCR product was digested for a 
minimum of 3 hours at 65°C using 15 units of Mvn I, 
and the resulting restriction fragments were 
visualised by ethidium bromide staining after 
electrophoresis on a 1.5% agarose gel. 

Relative quantification and comparison of CYP3A5 BXA 

Relative levels of CYP3A5 mRNA were determined by real 
time PCR using the ABI 7700 SDS (Perkin Elmer) . 
Optimal primers and probes for the detection of CYP3A5 
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were designed using the Pr imerExpress program, and 
subsequently checked to ensure specificity for CYP3A5. 
Primers utilised for the quantification PCR were: 
forward - 5 1 -AAGTGGCGATGGACCTCATC-3 ' ; reverse - 5 ? - 
GAGGAGCACCAGGCTGACA-3 ' . The TaqMan probe was labelled 
with the 5' reporter dye 6-carboxy- f louresine (FAM) , 
and had the sequence 5 1 -CAAATTTGGCGGTGGAAACCTGGC-3 1 ♦ 
Optimal primer/probe ratios and concentrations were 
determined and the experiments run according to 
standard protocols for the ABI 7700 Standard Detection 
System. CYP3A5 mRNA expression for all samples was 
normalised against the expression of p-actin mRNA. The 
threshold cycle (Ct) is the PCR cycle number where the 
ABI 7700 begins to detect an increase in fluorescent 
signal associated with the linear amplification of PCR 
product- The Ct value is dependent on the initial 
amount of template copy. Quantities of CYP3A5 in each 
sample were determined by averaging the Ct from 3 
separate PCR reactions of each sample. Relative 
differences in Ct between samples were calculated by 
subtracting the Ct of each sample from the highest Ct 
within the samples (lowest expression) . Since the 
amount of PCR product doubles with every cycle in the 
linear range of a PCR the differences in Ct were 
converted into estimated differences of mRNA quantity 
between the samples by calculating 2 <5ct , where <5Ct is 
the difference in cycle threshold between two samples. 

Negative controls were performed on each run to ensure 
that no signals were due to DNA contamination* Control 
samples consisted of RNA" samples which had been 
treated in exactly the same manner as for the 
quantitative PCR, but without the addition of the 
reverse transcriptase. 
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Statist; steal Analysis 

Statistical analysis was performed on the JMP 
Statistical program version 3.2.2 (SAS Institute 
Inc.). Metabolic ratio and CYP3A5 mRNA expression data 
were checked to ensure that they conformed to a normal 
distribution. CYP3A5 mRNA expression data did not 
conform to a normal distribution and were ln- 
transformec, afterwhich the data was normally 
distributed. Metabolic ratios and expression levels 
were compared between groups using a t-Test. 

Western Blot Analysis 

Forty micrograms of microsomal protein prepared from 
each liver were solubilised in an equal volume of 
Laemmli sa-ple buffer (Biorad) by four cycles of 
freezing and boiling for 10 minutes. Samples were 
loaded onto pre-cast 10% SDS-PAGE Ready Gels (Biorad) 
and electrcphoresed for 1 hour at 180 V. Separated 
proteins were transferred onto Hybond-P membranes 
(Amersham) using a Trans-blot SD apparatus (Biorad) . 
Membranes were blocked by an overnight incubation at 
4°C in lx ?5S containing 5% (w:v) nonfat milk and 0.1% 
(v:v) Tweer.. Membranes were incubated at ambient 
temperature for 1 hour in a 1:3000 dilution of 
specific hurr.an CYP3A5 antibody (Gentest) in IX PBS, 
2.5% nonfa- milk, then rinsed four times in lx PBS, 
2.5% (w:v) nonfat milk, 0.1% (v:v) Tween. Membranes 
were incubaied at ambient temperature for 1 hour in a 
1:5000 dilu-ion of Anti-Rabbit igG peroxidase 
conjugate (Sigma) in lx PBS, 2.5% (w:v) nonfat milk, 
and rinsed as previously. The membranes were 
developed using the ECL Plus Western Blotting 
Detection System (Amersham) according to 
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manuf act: urer' s instructions, and visualised by 
autoradiography using Kodak X-Omat film (sigma) . 



Example 3=. 



Mldazolaxxi pheno typing 



A panel of 39 liver samples was phenotyped for CYP3A5 
activity,, using the metabolism of midazolam to its 1- 
OH metabolite as a marker of activity. Human liver 
microsomal samples containing CYP3A5 in addition to 
CYP3A4 exhibit a significantly greater ratio of 1-OHM 
to 4-OHM compared with samples containing only CYP3A4 . 
1-OHM/4-OHM ratios between 5 and 9 were observed for 
microsomes containing both CYP3A4 and CYP3A5 . Samples 
containing only CYP3A4 showed 1-OKM/4-OHM ratios < 4 
(15) . Analysis of the CYP3A5 phenotypes in our data 
set showed a clear bimodal distribution, with 6 
samples (15%) having metabolic ratios greater then 5, 
and the remaining samples having metabolic ratios 
lying between 1.5 and 3.5 (see Fig. la) . Of the 39 
liver samples from which microsomes were prepared for 
metabolic analysis, sufficient tissue was available 
for full DNA and RNA analysis for 26, which included 6 
samples lying in the higher metabolic ratio range. In 
addition to these 26 samples microsomes for pirotein 
analysis were available for a further 3 samples, all 
of which had metabolic ratios of <4 . 



Analysis of CYP3A5 gene 5 % flanking xreg-lon 

The 5' flanking region of CYP3A5 was PCR-ampli f ied 
from genomic DNA of all 26 samples and sequenced in 
full, as shown in Figure 7. Alignment showed that the 
region was well conserved. Only a s^nall number of 
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inter-individual variations were identified in 
addition to a few variations from the published 
sequence (Table 2.). All variants detected were 
heterozygous, and all samples heterozygous for the 
more frequent A_ 147 G mutation were also heterozygous 
for the T_.- 5 G mutation, suggesting that the two 
mutations were linked. These two mutations fall 
within two separate putative regulatory elements, a 
basic transcription element (BTE : A. 147 G) and an 
activator protein-3 motif (AP-3: ?_< 75 G) . None of the 
remaining variants fell within putative regulatory 
domains . 



on 



PCR assays were developed to confirm the presence of 
the A. :47 G and T.„ s mutations individually, and to 
ascertain if the two mutations were on the same, or 
separate chromosomes. The PCR assay for the A_ X47 G 
mutation was based on the creation of a recognition 
site for the restriction enzyme Tai I by utilising an 
oligonucleotide mismatch primer (3A5R1). This primer 
introduces a Tai I recognition site only when the 
wild-type "A" nucleotide is present at position -147. 
Digestion of the 369bp product with Tai I yields 
fragments of 34 9 and 20bp for the wild-type sequence, 
whilst the product remains undigested if the mutant 
"G" nucleotide is present (Fig. 2). Similarly, for the 
detection of the T. 475 G mutation a second 

oligonucleotide mismatch primer was used (3A5F2) . This 
primer introduces a recognition site for the 
restriction enzyme Alu I when the wild-type T 
nucleotide is present at 'position -475, digesting the 
product to yield fragments of 318, 33 and 18 bp. This 
site is lost when the mutant G nucleotide is present, 
yielding digestion products of 336 and 33 bp (Fig. 3). 
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To determine if the mutations were present on the same 
chromosome a PCR assay was developed utilising two 
oligonucleotide mismatch primers (3A5F1 and 3A5R1) , 
both primers introducing recognition sites for the 
5 restriction enzyme Mvn I when the mutant nucleotides 
are present at positions -147 and -475. If the 
mutations are present on the different chromosomes 
then the original 369 bp product is digested to yield 
products of 349/350 bp and 20/19 bp (inseparable by 

10 gel electrophoresis), whilst if present on the same 

chromosome the fragment is digested to yield products 
of 330 and 20/19 bp (data not shown) . In addition to 
confirming the individual genotypes of the samples as 
determined by sequencing the two mutations were, in 

15 all cases, linked on the chromosome (data not shown). 

Relationship Jbetween CYP3A5 allelic variants, C2YP3A5 
mediated metahollsm, CYP3A5 mRNA and protein 
expression 

20 

Samples were grouped according to genotype: x 'W±.ld- 
type" or "nuitant" (containing the linked 
polymorphisms), and the 1-OHM/4-OHM metabolic, ratios 
(mr) were compared between the groups (Fig. 4a) . With 

25 the exception of one outlier (liver sample numboer, mr 
= 2.08), all individuals carrying the linked mutations 
had metabolic ratios > 5.0, whilst the wild typoe group 
all possessed metabolic ratios of < 3.5. The mean 
metabolic ratios for the mutant group were 

30 significantly higher than 'those from the wild-type 
group (6.0 ± 2,0 versus 2.7 + 0.42, mean + standard 
deviation; p < 0.001). 

Quantitative PCR was used to ascertain if the 
35 mutations ;r the 5' flanking region were affecting 
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gene expression. Whilst mRNA levels showed greater 
variation than the metabolic data, a degree of 
bimodalit y was observed (Fig. lb). The mutant group 
had CYP3A 5 mRNA levels skewed towards the higher end 
of the expression range, showing significantly higher 
levels of CYP3A5 mRNA than the wild type group (mean 
InCt - 4. 03, standard deviation = 0.97, against mean 
InCt = 2. 06, standard deviation =1.2, p < 0.006) 
((Fig. 4b) - In this case the outlier (presenting with 
the mutant: genotype, but wild type metabolic ratio) 
also fell within the lower range of expression ( InCt = 
2.9). 

The level of CYP3A5 protein expression levels was 
determined for 29 liver samples by Western blot 
analysis using a CYP3A5 specific antibody. A single 
band of 52 kDa corresponding to CYP3A5 was clearly 
apparent in some samples . With the exception of the 
single outlier with the high expression genotype 
(mutant) and low metabolic ratio phenotype (wild- 
type), all samples which possessed the high expression 
genotype, £ high metabolic ratio and high RNA 
expression level clearly show high levels of CYP3A5 
expression when compared to those samples with the low 
expression genotype and phenotype (Fig. 5) . The 
single outlier with the high expression genotype, but 
low expression phenotype showed levels of CYP3A5 
expression similar to those in the low expression 
genotype grc-p. Longer exposure of the Western blot 
indicated that a very low level of CYP3A5 expression 
was apparent in most samples (data not shown). 

The 5' flanking sequences of CYP3A5 obtained in this 
study are virtually identical to those published by 
Jounaidi et al. (11), and sh~w little inter-indi vidual 
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variation in sequence. Interestingly, Jounaidi et al. 
sequenced two human genomic clones, one of which 
contained the two linked mutations described in detail 
in this report. This would suggest that one clone was 
5 derived from an individual in the low expression 
group, and one from an individual in the high 
expression/metabolism group. 

Previous studies had suggested that CYP3A5 was 

10 expressed in 10-30% of livers (7, 8, 9) whilst another 
study has stated that some expression is constitutive 
in all samples (10). The present study supports the 
findings that some CYP3A5 expression is constitutive, 
with some metabolic activity and mRNA being detected 

15 in. all livers studied, although CYP3A5 protein was not 
convincingly demonstrated in all samples using the 
procedures required. We detected enhanced RNA and 
protein expression in 23% of the samples for which 
tissue was available (6 out of 26), which is similar 

20 to the fraction of liver showing expression in 

previous studies. This supports the finding of Boobis 
et al. (10) that some show low level expression is 
constitutive in ail liver samples although this can 
only be detected using more sensitive detection 

25 techniques (such as PCR, and not by Western or 
Northern blot analysis) . 

Whilst both polymorphisms detected lie within putative 
transcriptional regulatory elements, we suspect that 

30 the variant within the BTE , is more likely to be 

responsible for altered expression since it has been 
reported that a BTE flanking the TATA box accounts for 
the constitutive expression of CYP1A1, and a similar 
region has oeen found in several other CYP qer\&s 

35 including CYP2B1, CYP2B2 , CYP2E1 (16) CYP3A4 ( 1_ 3 ) and 
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CYP3A7 (3-2). In the case of CYP3A4 gene this element 
has been shown to bind nuclear extracts (13) and a 
basic tra nscript ion element binding factor for CYP3A7 
(12), pointing to a role of this region in the general 
5 control of cytochrome P450 expression. The exact 

mechanism of up-regulation of CYP3A5 expression in the 
allelic variant described here remains to be 
determined although the presence of one of the 
mutations within the BTE, and the relevance of this 

10 element for the expression of other P450s indicates a 
possible mechanistic link. Using methylation 
interference footprint ing, it has been shown that all 
guanine residues within the BTE, and other guanine 
residues in the vicinity, interacted with the 

15 transcriptional factor Spl (19) . Given that the 

mutation within the BTE (Spl) described herein alters 
an adenine residue to a guanine residue, then this 
could facilitate binding of transcription factors to 
the variant form of the Spl. 

20 

Although there is considerable overlap in the range of 
CYP3A5 mRNA levels seen in the homozygous and 
heterozygous group, the distribution of metabolic 
ratios is clearly bimodal, as is the amount of CYP3A5 . 

25 We cannot exclude the presence of other polymorphisms 
that may affect the translation efficiency or protein 
stability of CYP3A5. But given the better correlation 
between DNA polymorphism and protein level and the 
notorious liability of RNA, the simpler explanation is 

30 that differential RNA degradation or yield (due to 
differences in sample handling) has blurred the 
distinction between high and low expressers. Whatever 
the explanation for the discrepancy at the mRNA level, 
it does not in any way diminish the predictive value 

35 of the DNA polymorphism described. 
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There is, however, one individual whose genotype 
{heterozygous mutant) is not predictive of his 
metabolic phenotype (low expression). The fact that 
CYP3A5 protein as well as mRNA levels were low in this 
5 outlier indicates that the explanation must be sought 
at the transcriptional level, e.g. in transcription 
factors controlling CYP3A5 expression. 

An AUG element in the 5 1 - untranslated region of the 

10 BTEB gene has been shown to be, at least in part, 

responsible for cell specific translational control .of 
BTEB (20) . Mutations within this region were shown to 
affect BTEB translation. Therefore, whilst the outlier 
in our study has a high expression genotype fox CYP3A5 

15 expression, this individual may have a "poor" 

expression phenotype for BTEB. Additionally, dt is 
possible that a mechanism similar to that responsible 
for inducing CYP1A1 expression may also affect CYP3A5 
expression. In addition to the BTE, CYP1A1 expression 

20 is mediated by a xenobiotic responsive element (XRE) . 

In this case inducers enhance expression by binding to 
a cytosolic receptor (Ah receptor) which is 
translocated into the nucleus (possibly in association 
with an accessory protein coded for at the Arnt gene) , 

25 where it binds the XRE (17, 18). Although variations 
in these and other transcription factors could further 
modulate CYP3A5 expression, this does not detract from 
the fact that the polymorphism described here seems to 
be the major determinant of CYP3A5 expression, at 

30 least in liver. 

Despite the relatively small number of samples 
available for analysis in the present study, s trong 
associations have been found between the two Linked 
35 polymorphisms on the one hand and both expression and 
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CYP3A5 mRNA, protein and activity levels in the liver 
on the other hand. The unravelling of a genetic 
mechanism for the polymorphic metabolism by CYP3A5 
will have important consequences in the field of 
pharmacogenetics. The ability to predict metabolism by 
genotypin<g will greatly facilitate disease association 
studies and may also help to explain adverse reactions 
or poor response to therapeutics which are metabolised 
by this cytochrome P450 isoform. It will also help in 
delineating which factors affecting CYP3A5 activity 
are genetic and which are environmental; for both 
further work will be required to fully understand the 
complex variation in expression observed with this 
enzyme . 

Putative promoter sequence analysis 

Materials and Methods: 

The sequence of the regulatory region of CYP3A5 was 
analysed with the ' f indpatterns ' program of the GCG 
sequence analysis package (GCG, Madison, Wisconsin) . 
This program finds specific DNA sequence motifs, 
patterns, and transcription binding sites, whose 
sequences are stored in the program, and are present 
in the sequence of interest. In the present analysis, 
at most one single mismatch or error per pattern is 
allowed in the sequence of interest, to detect if the 
two reported variations alter any known motifs or 
transcription binding sites. Results are identified 
in Figures 9 to 9d. 

The first, GCGTG to GCTTG variation 

removes binding sites for MBF-I_CS, MRE_CS2, and 
CNBP-SRE. 
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The second, CCACC to CCGCC variation 

replaces binding sites for apoE-undef ined-site-3, 
ApoE_Bl, APRT-CHO_US, and APRT~human_US 
by GCF-consensus, APRT-mouse_US, GC-box_(l), 
5 DSE_(1), Spl_CS4, Spl-hsp70_(l) , hsp70.2, Spl-IE- 

3.3, Spl-IE-4/5, IRE_(1), Spl-TPI__(4) 
does not affect the Yi-consensus pattern 

Both mutations affect transcription factor binding 
10 sites. 

Electrophoretic mobility shift assay {EMSA) 

An EMSA was carried out using the Spl NUSHIFT Kit from 

15 Geneka Biotechnology Inc. (Montreal, Canada) according 
to the manufactures instructions. Briefly, a 31-mer 
double-stranded oligonucleotide corresponding to the 
CYP3A5 5 ' -untranslated region containing the A, 1<J7 G 
polymorphism ( 5 1 -GGC AGC TGC AGC CCC GCC TCC TTC TCC 

20 AGC A-3 ' ) was end-labeled with 32 -P using T4 
polynucleotide kinase. 50,000 cpm (0.5 ng) 
oligonucleotide was incubated with 2 jag HeLa nuclear 
extract for 30 min at 16°C. Unlabeled mutant or 
wiidtype (5-* GGC AGC TGC AGC CCC ACC TCC TTC TCC AGC 

25 A-3') oligo nucleotide was added in 50~fold or 100- 
fold excess as indicated, 1 or 2 pi anti-Spl rabbit 
polyclonal antibody was pre-incubated with the nuclear 
extract at 4°C for 30 min as indicated. Nuclear 
extract, anti-Spl antibody and binding buffers were 

30 from Geneka Biotechnology Inc. Samples were separated 
on a 5% polyacrylamide (39:1) gel, in TGE buffer (25 
mM Tris, 190 mM glycine, 1 mM EDTA, pH 8.3). The dried 
gel was exposed to X-ray film. 



35 
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RESULTS 



Analysis of the 5 ' -untranslated region of the CYP3A5 
gene indicated that the A. U ,G polymorphism might 
create a binding site for the transcription factor 
Spl. An electrophoretic mobility shift assay (EMSA) 
was carried out to test this hypothesis. An oligo 
nucleotide containing the A. !47 G polymorphism was used 
to assay for binding factors present in HeLa nuclear 
extracts. A band shift was observed (Figure 8, lane 2) 
which was competed away with 50- and 100-fold excess 
respectively of unlabeled oligo nucleotide (Figure 8, 
lanes 3 and A), but not with wildtype oligo nucleotide 
(Figure 8, lanes 5 and 6) . This clearly indicates the 
presence of a protein factor in HeLa nuclear extracts 
capable of binding to the A. M7 G polymorphism region, 
but not to the wildtype region. Incubations in the 
presence of an antibody specific for the transcription 
factor Spl resulted in supershif ting of the A. lA1 G 
polymorphism oligo nucleotide (Figure 8, lanes 7 and 
8), indicting that Spl is binding to the A.,,-G 
polymorphism site. 

This change in binding affinity of transcription 
factor Spl to the 5 ' -untranslated region of the CYP3A5 
gene might account for the increase in transcription 
from the A_ 141 G polymorphic promoter and in turn, might 
contribute to the increase in metabolic rates 
correlated with the A_ 147 G polymorphisms. 

Genotyping of the cytochrome expression 

A group of 300 healthy Caucasian volunteers was 
genotyped for variations T. 4 , 5 > G and A.. 47 >G of the 
cytochrome P4 50 3A5 gene. 
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Test rationale 

The first objective concerned allele/genotype 
f requencies . 

Because the initial study included only 30 to 35 
different individuals, a 1 lei le /genotype frequencies 
could not be determined, Genotyping a group of 300 
subjects should permit determination of these 
frequencies and to check whether they are in agreement 
with the Hardy-Weinberg equilibrium. 

The second objective concerned the linkage of the two 
variations. In the initial study, all samples with the 
gene variations T_ 4 , 5 > G and A_ 14? > G (only 6 in total) 
were linked. To verify the suggested linkage, both of 
these GYP 3A5 polymorphisms were genotyped on a larger 
population . 

Materials and methods 

In order to minimize genotyping errors, genomic DNA 
samples from 300 healthy Caucasian volunteers were 
genotyped in a microti terplate based format, which 
ensured a blind and completely independent duplicate 
analysis of each individual sample. 
A 1343 bp 5' flanking region of CYP3A5 was PCR- 
amplified from genomic DNA using primers 3A51/3A52. 
PCR assays for both variations were performed using a 
1/100 dilution of the original 3A51/3A52 PCR product 
as template. Mismatch primers 3A5F2 and 3A5R1 were 
utilised for both assays. For the A_ 147 > G mutation the 
PCR product was digested with restriction enzyme Tai 
I, and for the T^ 75 > G mutation the PCR product was 
diges-ed wi~h restriction enzyme Alu I. After 
digestion the restriction fragments were separated by 
polyacrylamide gel electrophoresis and visualised by 
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silver staining. The genotypes were determined based 
on the DNA fragment patterns by two independent 
observers . 

Results 

1. Allelle/genotype frequencies 

In the population of 300 individuals, 53 heterozygous 
subjects (18%) were carrying one copy cf each of the 
variations, 246 subjects (82%) were homogenious for 

A„ 147 and T.^,, and one individual (0.3%) was carrying 

variations G. : « 7 and G_ 475 on both allelles (homozygous). 

These frequencies are in agreement with 3A5 expression 

found in previous studies (7,8,9) 

The allelle frequencies are in agreement with the 

Hardy-Weinberg equilibrium (Table 3). 

2. Linkage cf variations T„ 475 > G and A_ u - > G 

In all individuals, respectively variations T„ 47 = and 
A_ l47 , and variations G„ 47£ and G_ 147 , were equally 

represented in genotypes, indicating a strong linkage 
between both variations. Whether this linkage between 
both variations has some functional significance needs 
to be clarified further. As a consequence of the 
linkage, future genotyping will require only the 
analysis of one of the variations, whether it is the 
functional variant or not. 
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Table 1. Primers used for sequencing 5' flanking 
region of CYP3A5 from PCR product 3A51/3A52 (see 
text) . 



10 



15 



Primer 


Orientation^ 


Position* 


Sequence (5 ! -3 f ) 


3A51 


F 


-1237- -1217 


GGAAGCAACCTACATGTCCATC 


3A5p01 


F 


-978- -963 


AGTACAGGGAGCACAG 


3A5p08 


R 


-917- -932 


CACCTATTCATTCCTG 


3A5p02 


F 


-698- -684 


TGCTATCACCACAGAC 


3A5p07 


R 


-689- 704 


GGTGATAGCAATAGAC 


3A5p03 


F 


-364- -349 


AGGATGTGTAGGAGTC 


3A5p06 


R 


-417- -434 


CCTCACACAGATGTAACC 


3A5p04 


F 


-176- -161 


TAAGAACTCAGGTTCC 


3A5p05 


R 


-178- -194 


CAGAAACTGAAGTGGAG 


3A52 


R 


+105- +87 


ATCGCCACTTGCCTTCTTC 



20 



25 



f F = 5' to 3', R = 3' to 5' 

* Primer locations are based on CYP3A5 sequence data 
of Jounaidi et al (11) 



30 
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Table 2 . 



Position 


Variant Sequence 


Percentage 


-475 


T-K (T or G) heterozygote 


30.6% (11/36) 


-147 | A-R (A or G) heterozygote 


30.6% (11/36) 



TABLE 3. 

Hardy Weinberg Equilibrium test Jest: CYP3A5 -45 A>G 

Population: CON-JRF-1 





Observed values 




Expected 




N 


freq 


N 


freq 


genotype AA 


246 


0.820 


247.5 


0.825 


genotype AG 


53 


0.177 


50.0 


0.167 


genotype GG 


1 


0.003 


2.5 


0.008 


total 


300 


1 


300 


1 



1.112 - Chi-square (Pearson) 
0.292 = p-value 
1 = d.f. 

N freq 
Allele A 545 0.908 

Allele G 55 0.092 
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CLAIMS 

1. A method of identifying subjects having a 
high or low drug metabolising phenotype associated 
with cytochrome CYP3A5 expression, which method 
comprises the steps of: 

screening genomic DNA from said subject for the 
presence or absence of one or more polymorphic 
variants in a transcription regulatory region of the 
sequence encoding CYP3A5 characteristic of a high drug 
metabolising phenotype. 

2. A method of screening human subjects for 
suitability for treatment with a drug metabolised by 
CYP3A5 comprising screening for the presence or 
absence of one or more polymorphic variants in a 
transcription regulatory region of the sequence 
encoding CYP3A5 characteristic of a high drug 
metabolising phenotype . 

3. A method according to claim 1 or 2 
comprising screening for said one or more variants in 
a recognition site for a transcription factor of said 
regulatory region. 

4. A method according to any of claims 1 to 3 
comprising screening for said one or more variants in 
an activator protein-3 motif (AP-3) and/or basic 
transcription element (BTE) . 

5. A method according to any of claims 1 to 4 , 
comprising screening for said one or more variants at 
any one of positions -475 or -147 of the transcription 
regulatory region of the sequence encoding CYP3A5 the 
sequence of which region is illustrated in Figure 7. 
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6. A method according to claim 5 comprising 
screening for both of said variants at position -475 
or -147 of said transcriptional regulatory region of 
CYP3A5. 



7. A method according to any of claims 1 to 5 
wherein said DNA is amplified using oligonucleotide 
molecules which are capable of hybridising selectively 
to the wild type or variant sequences respectively 
such that generation of amplified DNA from said 
respective molecules will indicate whether said wild 
type or said variant is present. 

8. A method of identifying one or more 
polymorphic variants in a transcription regulatory 
region of DNA encoding cytochrome CYP3A5 said method 
comprising the steps of: 

1) subjecting the sample DNA to amplification 
using oligonucleotide molecules which are 
capable of selectively hybridising to the 
wild type and/or said one or more variant 
sequences, which molecules are such that 
they can introduce a restriction site in one 
of said amplified wild type or variant 
sequences , and 
2) subjecting amplified DNA from step 1 to 

restriction with an enzyme which cleaves at 
said restriction site to provide a 
restriction digest indicative of the 
presence or absence of said mutation. 

9. A method according to claim 8 wherein said 
molecule introduces a restriction site in a region 
corresponding to a recognition site for a 
transcription factor of said regulatory region. 
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10. A method according to claim 8 or 9 wherein 
said molecule introduces a restriction site in a 
region corresponding to an activator protein-3 motif 
(AP-3) and/or a basic transcription element (BTE) . 

5 

11. A method according to claim 10 wherein said 
molecule is capable of introducing a restriction site 
only when the wild type A nucleotide is present at 
position -147 of the transcription regulatory region. 

12. A method according to claim 11 wherein said 
restriction site is for the Tai I restriction enzyme. 

13. A method according to claim 11 or 12 wherein 
15 said oligonucleotide molecule comprises the sequence 

designated 3A5R1 illustrated in Figure 6. 

14. A method according to claim 10 wherein said 
molecule is capable of introducing a restriction site 

20 when the wild type T nucleotide is present at position 
-475 of the regulatory control region. 

15. A method according to claim 14 wherein said 
restriction site is for the Alu I enzyme. 

25 

16. A method according to claim 14 or 15 wherein 
said molecule comprises the sequence designated 3A5F2 
illustrated in Figure 6. 

30 17. An oligonucleotide molecule of at least 10 

contiguous nucleotides for use in amplification of a 
DNA sequence to detect a wild type or polymorphic 
variant in a transcription regulatory region of the 
sequence encoding cytochrome CYP3A5 said associated 

35 with a high or low drug metabolising phenotype 



WO 00/39332 



- 44 - 



PCT/GB99/04380 



respectively, which molecule is capable of hybridising 
to a region incorporating either a polymorphic variant 
or wild type nucleotide in said region, such that 
amplification of said wild type and polymorphic 
variants will proceed from said molecule only when an 
oligonucleotide includes a sequence corresponding to 
either said wild type or polymorphic variant 
characteristic of a high drug metabolising phenotype. 

18. A molecule according to claim 17 which is 
capable of hybridising to a recognition site for a 
transcription factor of said regulatory region. 



19. A molecule according to claim 17 or 18 which 
is capable of hybridising to an activator protein-3 
motif (AP-3) or a basic transcription element. 

20. A molecule according to any of claims 17 to 

19 which is capable of hybridising to a region 
comprising a polymorphic variant at any of positions - 
475 or -147 of the transcription regulatory region of 
the sequence encoding CYP3A5 illustrated in Figure 7. 

21. A molecule according to any of claims 17 to 

20 which comprises any of the sequences designated 
3A5F1, 3A5F2 or 3A5R1 illustrated in Figure 6. 

22. a kit for performing the method of any of 
claims 1 to 7 comprising an oligonucleotide molecule 
according to any of claims 17 to 21 and means for 
contacting said molecule- and said transcription 
regulatory region of the 'sequence encoding CYP3A5 . 

23. a kit according to claim 22 further 
comprising a restriction enzyme capable of producing a 
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restriction digest for distinguishing between said 
variant or wild type sequences. 

24. A kit according to claim 23 wherein said 
enzyme comprises any of Tai I or Alu I. 

25. A method of identifying toxic or mutagenic 
effects of a test compound; such as, a drug, toxin or 
procarcinogen metabolised by CYP3A5 the method 
comprising contacting each of a cell having a high 
drug metabolising phenotype and a cell having a low 
metabolising phenotype associated with cytochrome 
CYP3A5 expression, with said test compound and 
identifying the effects of said compound on each of 
said high or low drug metabolising phenotype cells or 
other cells sensitive to said compound. 

26. A method of diagnosing susceptibility of an 
individual to a disease associated with environmental 
toxins or procarcinogens metabolised by CYP3A5, which 
method comprises screening for the presence or absence 
of a polymorphic variant in a transcription regulatory 
region of the sequence encoding CYP3A5. 

27. A method according to claim 26 comprising 
screening for said variant in a recognition site for a 
transcription factor of said regulatory region. 

28. A method according to claim 26 or 27 
comprising screening fo^ said variant in an activator 
protein-3 motif (AP-3) ahd/or a basic transcription 
element (BTZ) of said transcription regulatory region. 

29. A method according to any of claims 26 to 
28, comprising screening for said variant at any one 
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of position -475 or -147 of the transection 
regulatory region of the sequence encoding CY-3A5 the 
sequence of which region is illustrated in Figure 7. 

30. A method according to any of cl aims 26 to 29 
-4 0 7 5 r or-L7 Creening Var±antS 3t P ° Siti - 



31. A method according to any of claims 26 to 30 
comprising screening for the presence or absence of 
variants t_,,g and A. 14?G in said transactional 
regulatory control region. 

32. A method of providing a measure of activity 
of a transcription regulatory region of a DNA sequence 
encoding cytochrome CYP3A5 or of identifying a 
polymorphic variant which alters transcription of 
cytochrome CYP3A5 , the method comprising providing a 
DNA construct having a sequence encoding a reoorter 
molecule operably li nke d to a DNA fragment comprising 
said transcription regulatory region, and introducing 
sard construct into a cell and monitoring for the 
level of expression of said reporter molecule. 

33. A method of identifying transcription 
factors capable of hybridising to a DNA sequence from 
a transcription regulatory region adjacent to DNA 
encoding cytochrome CYP3A5, said method comprising 
contacting said DNA sequence including said 
transcription regulatory region with potential 
transcription factors and, identifying any 
transcription factor complexed to said DNA sequence. 

„ , 34 " A meth ° d ° f identifying compounds acting on 
a transcription regulatory region of a DNA sequence 
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encoding CYP3A5, the method comprising transforming a 
cell with a DNA construct comprising the sequence of 
said regulatory region, and which regulatory region is 
operably linked to a sequence encoding a reporter 
molecule, contacting said cell with a test compound 
and identifying any altered expression of said 
reporter molecule. 

35. A method of purifying transcription factors 
from a sample which are capable of binding to DNA from 
a transcription regulatory region of a sequence 
encoding cytochrome CYP3A5 ; the method comprising 
contacting a DNA sequence including said 
transcriptional regulatory region with a mixture of 
transcription factors and identifying any complexes of 
said transcription factors and said sequence. 

36. A method according to any of claims 32 to 35 
wherein said transcription regulatory region includes 
a mutation in a recognition site for a transcription 
factor of said regulatory region. 

37. A method according to any of claims 32 to 36 
wherein said mutation occurs in an activator protein-3 
motif (AP-3) and/or a basic transcription element 
(BTE) . 

38. A method according to any of claims 36 or 37 
wherein said mutation occurs at any one of positions 
-475 or -147 of the transcription regulatory region 
adjacent to the sequence Encoding CYP3A5, the sequence 
of which region is illustrated in Figure 7. 

39. A method according to any of claims 32 to 38 
wherein the transcription rerr-latory region comprises 
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the mutations T_, 75 G and A_ 147 G. 
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3A5R1 5'TTTATGTGCTGGAGAAGGACG-3' 
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